A Sinusoidal Noise Model Based Speech Synthesis For Phoneme Transition
نویسندگان
چکیده
One well-known problem with speech synthesis is the occurrence of audible discontinuities at phoneme boundaries, which lead to the unnaturalness of synthetic speech. This paper presents a sinusoidal noise based mathematical method to reform the transition regions from one phoneme to another phoneme with low storage. The speech parameters of sinusoidal noise model were estimated and stored as polynomials to reconstruct the transition wave. According to the results, all transitions regions which are considered during this experiment have higher correlation values for lower order polynomial with less capacity ratio. In addition, to that the same experiment has been carried out by changing the number of FFT coefficient. As the FFT coefficient increases, capacity ratio was also increased, while correlation coefficient values were also increased. It was understood that a signal which is very close to the original signal can be generated with a lesser number of FFT coefficients
منابع مشابه
Polynomial quasi-harmonic models for speech analysis and synthesis
Harmonic plus noise models have been successfully applied to a broad range of speech processing applications, including, among others, low bit-rate speech coding, and speech restoration and transformation. In conventional methods, the frequencies, the relative phases and the amplitudes of the pitch-harmonic components are assumed to be piecewise constants over an analysis frame. This assumption...
متن کاملAllophone-based acoustic modeling for Persian phoneme recognition
Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...
متن کاملAccurate estimation of sinusoidal parameters in an harmonic+noise model for speech synthesis
We present here an Harmonic+Noise Model (HNM) for speech synthesis. The noise part is represented by an autoregressive model whose output is pitchsynchronously modulated in energy. The harmonic part of the signal is represented by a sinusoidal model. This paper compares di erent methods for separating these two components. We then propose a method for the estimation of the sinusoidal parameters...
متن کاملImproving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM
Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...
متن کاملParameterizing Speech Phonemes by Exponential Sinusoidal Model
The exponential sinusoidal model (ESM) for parameterizing speech signals is proposed in this paper. The main feature of the ESM is that the amplitude of each sinusoidal component is allowed to vary exponentially with time. A novel variable segmentation strategy is applied first to separate individual transients of a voiced phoneme, which can then be fitted with the ESM. The epoch of a transient...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014